CultureSampo: A National Publication System of Cultural Heritage on the Semantic Web 2.0
نویسندگان
چکیده
CULTURESAMPO is an application demonstration of a national level publication system of cultural heritage contents on the Web, based on ideas and technologies of the Semantic (Web and) Web 2.0. On the semantic side, the system presents new solutions to interoperability problems of dealing with multiple ontologies of different domains, and to problems of integrating multiple metadata schemas and cross-domain content into a homogeneous semantic portal. A novelty of the system is to use semantic models based on events and narrative process descriptions for modeling and visualizing cultural phenomena, and for semantic recommendations. On the Web 2.0 side, CULTURESAMPO proposes and demonstrates a content creation process for collaborative, distributed ontology and content development including different memory organizations and citizens. The system provides the cultural heritage contents to end-users in a new way through multiple (nine) thematic perspectives, based on semantic visualizations. Furthermore, CULTURESAMPO services are available for external web-applications to use through semantic AJAX widgets. 1 A Basis for Cultural Heritage on the Semantic Web In our view, a cross-domain semantic cultural heritage portal [1] should be built on three pillars: First we need a cross-domain content infrastructure of ontologies, metadata standards, and related services, that is developed and maintained on a global level through collaborative local efforts. Second, the process of producing ontologically harmonized metadata should be organized in a collaborative fashion, where distributed content producers create semantically correct annotations cost-efficiently through centralized services. Third, the contents should be made available to human end-users and machines thought intelligent search, browsing, and visualization techniques. For machines, easy to use mash-up APIs and web services should be available. In this way, the collaboratively aggregated, semantically enriched knowledge base can be exposed and reused easily as services in other portals and applications in the same vein as Google Ads or Maps. 1 http://code.google.com/apis/maps/ CULTURESAMPO is an operational online demonstration on a national Finnish level of implementing of such a semantic portal. The system consists of a national ontology infrastructure [2], content creation process, and an online portal [3]. In this paper, the main parts of CULTURESAMPO are first shortly overviewed. After this contibutions of the work are summarized w.r.t. related work. 2 Collaborative Ontology Infrastructure An integral part of CULTURESAMPO are the ontologies and services of the FinnONTO infrastructure [2]. The general idea here is to extend the generic, logic based W3C recommendations with domain specific ontologies in different domains. Most of the FinnONTO ontologies were developed by transforming nationally used thesauri into light weight ontologies in a semi-automatic process. In the FinnONTO model, the ontologies are developed in a distributed fashion by collaborating expert groups of different fields, and are mapped together to form a large national ontology called KOKO encompassing all domains. At the moment, KOKO includes an upper ontology YSO (20 600 concepts), a museum ontology MAO (6800 concepts), an agriforestry ontology AFO (5500 concepts), an applied art ontology TAO (2600 concepts) and a photography ontology VALO (1900 concepts). In addition, CULTURESAMPO also utilizes a geographical registry of 800 000 places in Finland, a spatiotemporal ontology of Finnish counties 1865–2007 [4], an ontology of persons and organizations, and ontologized international systems, such as the Iconclass, and the Union List of Artists Names (ULAN). 3 Distributed, Cross-domain Content Creation The contents of CULTURESAMPO come from 22 museums, archives, and libraries, most of which produce their contents independently from each other using heterogeneous cataloging systems and practices. The system also contains external web contents: all Wikipedia articles (in English and Finnish) that have coordinate information, as well as related photographs from the Panoramio service can be found on CULTURESAMPO’s map views. CULTURESAMPO also has an internal Web 2.0 commenting facility by which citizens can contribute new knowledge to individual content items, e.g. identify persons in an old photograph of a museum collection. In these ways, citizens are able to contribute to the “national semantic memory”. Furthermore, interactive content production based on the SAHA editor [5] has been used in the system by participating organizations. The system contains cultural objects of 67 different content types: artifacts, paintings, drawings, sculpture, abstract art, novels, comics, web pages, three types of folklore, five types of folk music, photos, aerial photos, persons, organizations, biographies, historical events, skills, videos, buildings, archeological sites, and others. The metadata is represented using 204 different properties and 253 additional properties used in ontologies. The content types harvested from the collections of the participating memory organizations are represented using 18 different 2 The portal prototype is publicly in use at http://www.kulttuurisampo.fi/. 3 http://www.iconclass.nl/ 4 http://www.getty.edu/research/conducting research/vocabularies/ulan/ metadata schemas. The aggregated knowledge base contains 134,000 cultural objects and 285,000 additional cultural resources (persons, places etc.), described by 2.7 million triples. The content is enriched using reasoning, that generates 8,7 million additional RDF triples in the knowledge base. From a semantic modeling viewpoint, a research focus of our work has been eventand process-based annotations used in artificial intelligence and knowledge representation [6]. Events have been used for modeling cultural processes and narrative stories; a simplified version of the model [7] is in use. An example of a cultural process is “Production of Ceramics” created by experts at the University of Applied Arts in Helsinki. It illustrates and explains the composition of different work phases when manufacturing ceramics. Our process model has also been applied to videos. A case example available on the portal describes how the shoe maker Onni Wirlander manufactured a pair of traditional leather boots. The video can be viewed directly starting from different points of interest. When watching the video, the recommendation system creates dynamically, for each subsequence separately, recommendation links to materials of interest in the portal, such as tools related to the sequence. Content creation in CULTURESAMPO is supported by a number of generic tools, such as the National Ontology Library Service ONKI, the metadata editor SAHA, information extraction tool POKA, and the semantic content validator VERA. 4 The Semantic Web 2.0 Portal CULTURESAMPO The portal is an end-user application for both 1) human users and 2) machines, and is multilingual (Finnish, Swedish, and English). A major novelty of the portal is to provide its contents through nine “thematic perspectives”: Map views. There are 4 map views [4] available using Google Maps: One for viewing all objects and filtering them in terms of their relation to the places; one for finding old Finnish counties with digitized limits; one for viewing historical maps layered semi-transparently over modern Google Maps ; one for finding near-by objects of interest. It is also possible to find Wikipedia articles and Panoramio photos by using the map interfaces. Relational search. This view is a demonstration of relational search [8], where the idea is not to search for objects but associative relation chains between objects. We used the ULAN registry of 120,000 artists and organizations with 390,000 names. The user types in two names, using semantic autocompletion, and CULTURESAMPO tells how the persons or organizations are related with each other by the social network based on some 50 different social roles (e.g., parent-of, teacher-of, patron-of etc.). The underlying social RDF/OWL network can also be browsed by a graphical network browser. Search and organize. In this view, a new kind of generalized version of the faceted-search paradigm has been developed, called domain centric search [9]. This advanced search and analysis facility is used for filtering contents and for visualising the results on facets, maps or a timeline. For example, it is possible to find out how the beard fashion, as depicted in art and photographs, has changed in the 19th century in Finland. 5 http://www.yso.fi/ 6 http://www.seco.tkk.fi/tools/poka/ 7 http://www.seco.tkk.fi/services/vera/ Collections view. Here the contents can be accessed based on an organizational view. Each participating organization has an automatically generated home page in the system with links to subcollections and the actual collection items. Finnish history view. This view is based on an ontology representing events in the Finnish history. These events are of interest of their own, but are also used to create semantic recommendations to other CULTURESAMPO contents, e.g. to biographies of persons participating in the events. Skills and processes. This view is used for finding cultural procedural descriptions in the system, i.e. semantic models of skills, processes, and documentations on videos. Biographies. In this view, biographies of the National Biography are used to access CULTURESAMPO contents. When reading biographies, related contents are shown based on the concepts extracted from the text using the information extraction tool POKA. Semantic Kalevala. This view [10] contains a semantically annotated version of the national epic of Finland, Kalevala, that is related in many ways to Finnish art and culture. The epic is based on old Finnish folklore; thousands of runes and pieces of folk music are available in the portal, too. When reading Kalevala, annotations related to its subsequences can be viewed to help reading, and semantic recommendations to related materials, such as art, artifacts etc. in CULTURESAMPO are automatically produced. Karelia view. This view contains Wikipedia articles about the Karelia area in Finland that has been influential to Finnish culture. Like in the biographies view, the POKA system is used for extracting concepts from the texts (here web pages) and for generating semantic recommendations. In addition to humans, the system can be used by machines via an AJAX interface. In this way, the collaboratively aggregated and semantically enriched national CULTURESAMPO knowledge base can be used by other portals and systems on the web, in the same spirit as Google Ads or Maps. 5 Discussion and Conclusions The vision and implementation of CULTURESAMPO goes beyond current semantic web portals for cultural heritage [1], such as MuseumFinland [11], MultimediaN 2006 [12], and CHIP Demonstrator 2007 [13]: The CULTURESAMPO system 1) is highly cross-domain with lots of content types and metadata schemas (usually only one schema such as Dublin Core or VRA is used), 2) it makes use of sophisticated semantic annotation models including events and processes, 3) it uses new kind of semantic search and recommendation techniques, 4) it has exceptionally versatile selection of semantic visualizations available (different map views, timelines, graphs, process visualization, semantic video viewing), 5) it is based on a large nation wide collaboratively maintained infrastructure of ontologies and ontology services, 6) it includes a model of and tools for collaborative semantic content creation, and 7) the services are available for machines, too. Acknowledgements This work is a part of the national FinnONTO research project 2003–2007, 2008–2010, funded mainly by Tekes and a consortium of 8 http://www.kansallisbiografia.fi/ 9 http://www.seco.tkk.fi/projects/finnonto/ 38 companies and public organizations, partly by the EU FP7 SmartMuseum project2008–2010, and by the Finnish Cultural Foundation (2008–2010).
منابع مشابه
CultureSampo—Finnish Culture on the Semantic Web 2.0: Thematic Perspectives for the End-user
We present an overview of CultureSampo, an ambitious system for creating a collective semantic memory of the cultural heritage of a nation on the Semantic Web 2.0, combining ideas underlying the Semantic Web and the Web 2.0. The system addresses the semantic web challenge of aggregating highly heterogeneous, cross-domain cultural heritage collections and other contents into a semantically rich ...
متن کاملTHE 1ST INTERNATIONAL SYMPOSIUM ON DIGITAL HUMANITIES FOR JAPANESE ARTS AND CULTURES (DH-JAC2009), RITSUMEIKAN 1 CultureSampo —Finnish Cultural Heritage Collections on the Semantic Web 2.0
This paper presents an overview of the SemanticWeb 2.0 application CultureSampo, an ambitious system for creating a collective semantic memory of the cultural heritage of a nation on the Semantic Web 2.0, combining ideas underlying the Semantic Web and the Web 2.0. The system addresses the semantic web challenge of aggregating highly heterogeneous, cross-domain cultural heritage content into a ...
متن کاملCultureSampo - Finnish Culture on the Semantic Web
This paper presents the semantic portal “CULTURESAMPO—Finnish Culture on the Semantic Web”1 [1, 2]. The portal provides memory organizations and other cultural content publishers with a national, shared semantic publication channel for heterogenous cultural contents. The content comes from over ten organizations and is annotated using various ontologies of the FinnONTO infrastructure [3]. For t...
متن کاملHow to deal with massively heterogeneous cultural heritage data - lessons learned in CultureSampo
This paper presents the CultureSampo system from the viewpoint of publishing heterogeneous linked data as a service. Discussed are the problems of converting legacy data into linked data, as well as the challenge of making the massively heterogeneous yet interlinked cultural heritage content interoperable on a semantic level. In the approach described, the data is published not only for human u...
متن کاملCULTURESAMPO –A Collective Memory of Finnish Cultural Heritage on the Semantic Web 2.0
This paper presents the SemanticWeb 2.0 application CULTURESAMPO, an ambitious system of creating a collective semantic memory of the cultural heritage of a nation on the Semantic Web 2.0, combining ideas underlying the Semantic Web and the Web 2.0. The system addresses the semantic challenge of aggregating highly heterogeneous, cross-domain cultural heritage into a semantically rich intelligen...
متن کامل